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Abstract 



In this paper we prove optimality of a certain class of Analog to Digital Converters (ADCs), which can 
be viewed as generalized Delta-Sigma Modulators (DSMs), with respect to a performance measure that can be 



characterized as the worst-case average intensity of the signal representation error. An analytic expression for 
the ADC performance is given. Furthermore, our result proves separation of quantization and control for this 
■ class of ADCs subject to some technical conditions. 



I. INTRODUCTION AND MOTIVATION 



Analog to Digital Converters (ADCs) act as the interface between the analog world and digital 

t>. 

processors. They are present in almost all digital control and communication systems and modern high- 

OV 

speed data conversion and storage systems. Naturally, the design and analysis of ADCs have, for many 

■ 

O years, attracted the attention and interest of researchers from various disciplines across academia and 



industry. Despite the progress that has been made in this field, the design of optimal ADCs remains an 
open challenging problem, and the fundamental limitations of their performance are not well understood. 
This paper is concerned with the latter problem. 

A particular class of ADCs primarily used in high resolution applications is the Delta-Sigma Mod- 
ulator (DSM). Fig. Q] illustrates the classical first-order DSM (TJ, where Q is a quantizer with uniform 
step size. 

An extensive body of research on DSMs has appeared in the signal processing literature. One well 
known approach is based on linearized additive noise models and filter design for noise shaping 0]|- 
O. The underlying assumption for validity of the linearized additive noise model is availability of a 
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Fig. 1. Classical First-Order Sigma-Delta Modulator 



relatively high number of bits. Alternative approaches based on a formalism of the signal transformation 
performed by the quantizer have been exploited for deterministic analysis in Q-O. Some other works 
that do not use linearized additive noise models are reported in |fT0l - [fT2ll . 

In control literature, |[T3l - lfT5l find performance bounds and suboptimal policies for linear stochastic 
control problems using Bellman inequalities with quadratic value functions. The problem is relaxed 
and solved using linear matrix inequalities and semidefinite programming. For references on quantized 
control, please see lfT6l - |[T8i 

In |fT9ll and GUll we provided a characterization of the solution to the optimal ADC design problem 
and presented a generic methodology for numerical computation of sub-optimal solutions along with 
computation of a certified upper bound and lower bound on the performance, respectively. 

Fig. [2] illustrates the setup we use for measuring the performance of the ADC. The performance of an 
ADC is evaluated with respect to a cost function which is a measure of the intensity of the error signal e 
(the difference between the input signal r and its quantized version u) for the worst case input sequence. 
The error signal is passed through a shaping filter which dictates the frequency region in which the 
error is to be minimized. Furthermore, we show that the dynamical system within the optimal ADC is 
a copy of the shaping filter used to define the performance criteria. 



ADC 



u e 

-^o 



Shaping 
Filter 



Fig. 2. Setup Used for Measuring the Performance of the ADC 



In lfT9ll we also presented an exact analytical solution to the optimal ADC for first-order shaping 
filters, and showed that the classical first-order DSM (Figure [Q) is identical to our optimal ADC. This 
result proved the optimality of the classical first-order DSM with respect to the adopted performance 
measure, and was a step towards understanding the limitations of performance. In this paper we provide 



the optimal solution for higher order shaping filters subject to certain technical conditions and prove 
optimality of some higher order DSMs. 



Notation and Terminology: 

• Given a set P, £+(P) is the set of all one-sided sequences x with values in P, i.e. functions 

x : Z + ^ P. 



II. Problem Formulation 
The problem setup in this section is taken from [fP9ll . 



A. Analog to Digital Converters 

In this paper, a general ADC is viewed as a causal, discrete-time, non-linear system \I>, accepting 
arbitrary inputs in the [—1,1] range and producing outputs in a fixed finite subset U C M, as shown in 
Fig. [3l We assume maxf/ > 1 and mmU < — 1. 



r[n] e [-1,1] 



n G Zj 



u[n] G J7 



n G Z_i 



Fig. 3. Analog to Digital Converter as a Dynamical System 



Equivalently, an ADC is defined by a sequence of functions T„ : [— 1, 1] \-t U according to 

^ : ix [n] = T n (r[n], r[n — 1], • • • , r[0]) , n G Z + . 
The class of ADCs defined above is denoted by y v . 



(1) 



B. Asymptotic Weighted Average Intensity (AWAI) of a Signal 

Let M+ be an even, non-negative, and monotonically nondecreasing function on the positive 

reals; and G (z) be the transfer function of a strictly causal LTI dynamical system L G with input w and 
output q: 

{x[n+ 1] = Ax\n] +Bw[n], x\0] = 0, 
(2) 
q[n] = Cx[n] 



where A, B, C are given matrices of appropriate dimensions. The Asymptotic Weighted Average 
Intensity r\ G ^ (w) of signal w with respect to G (z) and is given by: 

r]G,4>{w) = limsup— YV(g[n]) . (3) 

Examples of functions to consider are: 0(g) = \q\ and 0(g) = \q\ 2 . We assume without loss of 
generality that CB ^ 0. Indeed, since T)g,4> does not change if G(z) is replaced by zG(z), i.e. if q[n) is 
replaced with g[n + 1] in ©, the case when CB = can be reduced to the case CB ^ by extracting 
a delay from L G . 

C. ADC Performance Measure 

The setup that we use to measure the performance of an ADC is illustrated in Fig. [4} The performance 
measure of ^ G yu, denoted by J G ^ (\1>) , is the worst-case AWAI of the error signal for all input 
sequences r G £+([— 1, 1]), that is: 

JgA*)= su P (r)). (4) 
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Fig. 4. Setup Used for Measuring the Performance of the ADC 



D. ADC Optimization 

Given L G and 0, we consider ^ G y v an optimal ADC if J Gj(j) ($ D ) < J" G ^ (*) for all * G 3V 
The corresponding optimal performance measure 7^ (Z7) is defined as 

7o> , (17) = inf Jq* m • (5) 

WfcJ/jy 

III. Our Approach 

We search for the optimal ADC within the class of time invariant state-space models and associate 
the optimal ADC design problem with a full-information feedback control problem. We show for a 
certain class of ADCs that the setup depicted in Figure [5] is an optimal ADC architecture. The function 



K : M m x [—1, lj i — y JJ is said to be an admissible controller if there exists 7 G [0, 00) such that every 
triplet of sequences (x^,u,r) satisfying 



xqj [n + 1] = Ax$, [n] + Br [n] — Bu [n] , x^ [0] = 0, 
u [n] — K (xq, [n] , r [n]) , 
q^f [n] = Cxy [n] , 



also satisfies the dissipation inequality 



N-l 



sup Y](0(g*[n])-7) <oo 

N,re£+(l-hq) n=0 



(6) 
(7) 
(8) 

(9) 



Note that if © holds subject to ©-([8]), then J G ^ (\&) < 7. Let 7 be the maximal lower bound of 7, 
for which an admissible controller exists. Then K is said to be an optimal controller if © is satisfied 
with 7 = 7o- 
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Fig. 5. Full State-Feedback Control Setup 



IV. Main Result 

Consider the ADC optimization problem presented in Section [XT] with L G defined by © with CB 7^ 0. 

For 5 e (0, 2] and MeNU {00}, define the set U M and function K M :R-^U M as 

U M = {md \meZ, \m\< M} (10) 

Km(0) = rnin < arg min \9 — u\ \ . (11) 

ueUu 



(12) 



Consider the ADC $ G y Uu defined by 

[n + 1] = Axy [n] + Br [n] — Bu [n] , 
<?§M = Cx$[n] 
x$ [0] =0 
with the control law 

u[n] = K M ((CB)~ l CAx$[n) + r[n)) . (13) 

We show in Theorem Q] below that if M is large enough and 5 is small enough, then the ADC defined 
above is optimal. The control decision u[n] in (fT3l minimizes |g$[n + l]|. An interpretation of Theorem 
[T]is that a greedy algorithm is optimal subject to certain conditions. Let 

k k 

q$[n + 1] = ^a^n - i] + ^ b i( r [ n ~ j\ ~ u [ n ~j])- ( 14 ) 

i=0 j=0 

be the difference equation which is equivalent to (fT2l) . Let T be the causal LTI system with transfer 
function 

— • (15) 



F(z) 



E 6 

3=0 



Let {q}^ be the unit sample response of system (PT41) . i.e. 

F(z) — y^ciz~ l , for \z\ > R 
1=0 

where R G R is the maximal absolute value of the largest pole of F(z) in (Q3 



(16) 



Theorem 1: Let $ G 3^ be the ADC defined by ([l2V([T3l with CB ^ and iT M defined by 
(O— (HB- Let 







CO 

En 

Z=0 



\cb\- + 1 +Efo 

where {a;}fL and {&/}jL are defined by ([T4|) and {q}^ is defined by £[5])-CE6])- Let M5 be such 
that M8 > 1 and 

M5 > /3 - 5. (17) 

Let / : [0, 00) —7- [0, 00) be a monotonically nondecreasing function and <f>(q) = f (\q\) . Then ^ is an 
optimal ADC in the sense that 



Jgj (*) > = (|CB|5/2) W G y Um . 



(18) 



Proof: Please see the Appendix. ■ 
Remark 1: We showed in |[P9l that the first-order DSM in Figure [T] is optimal with respect to the 

shaping filter L G — l/(z — 1) with any uniform quantizer Q with MS > 1. 
Remark 2: For L G = zj (z— l) 2 with any uniform quantizer Q with step size 5 < 2 and the magnitude 

of the largest value of the quantizer being larger than 1 + 5, the second-order DSM is optimal. 

The optimal ADC architecture presented in Figure [5] along with the optimal control law given in (TT3T ) 
can be equivalently represented by Figure [6] and equation ( fl9l ), where Q is a uniform quantizer with 
step size 5 and saturation level MS satisfying (flTI) and G(z) is the transfer function of the shaping filter 
Lq- Furthermore, Figure [6] has a DSM architecture, thus with a proper selection of Lq as the shaping 
filter, many standard DSMs that satisfy the conditions in Theorem Q] are proven optimal. 
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Fig. 6. Optimal ADC Architecture, where G(z) — C(zl — A) is the transfer function of Lg 



H{z) = (CB)- l zG(z) - 1 = (CB)- l C(zI - A)~^AB (19) 

That is, if the magnitude of the largest value of the quantizer output is large enough and quantization 
step size is small enough, then the greedy algorithm is the optimal output for the ADC. This shows 
separation of quantization and control for this problem, subject to inequality (TT7T ). 

V. CONCLUSION 

In this paper, we showed optimality of a certain class of ADCs (which were shown to have DSM 
like architecture) subject to some conditions and provided an analytic expression for the performance. 
We showed that there is separation of quantization and control, i.e. in the absence of quantization, the 
obvious choice for the optimal control law is proven to be the optimal control law given quantization, 
when certain technical conditions are met. 



VI. APPENDIX 

Proof of Theorem 3} Let us begin by showing that with the control law given in (fT3l with M = oo 
we have: 

\q$[n]\<\CB\5/2, VnGZ + , (20) 
Indeed, for n — 0, inequality fl20|) follows from the initial condition in (fT2j) . For n > 0, 

q^[n + 1] = CB(u/[n] - K(w[n])), 

where w[n] = (OB^CAz^fra] + r[n]. Since |0 - K{&)\ < 5/2 for all 6 G R, we have © for all 
n > 0. 

The next step is to use the bound < \CB\8/2 to show that \u[n}\ < (3. Rearranging (PT4l) . 

taking absolute value from both sides, and using the triangle inequality yields: 

,5 



k 
j=0 



< 



k \ k 



,i=0 / j=0 



If Yl'j=o b j u [ n ~ i] * s me input signal to the system J 7 with transfer function F(z) defined in (TT51) , 
then the output u[n] is bounded in magnitude by 

\u[n]\<f3 (21) 

A sufficient condition for \u[n]\ < M5, is given by (fTTT) . (|2T1) . and u G U^. Therefore (fTTT) implies 
©. 

Since both systems Lg and L$ have the same input and x§ [0] = ,x [0] = 0, condition (120]) implies 
that 

|g[n]| < |CB|5/2, Vn G Z+. 

Therefore, 

AT 

sup V (0 (g [n]) - <f>(\CB\6/2)) < < oo, 
JV.re[-i,i] n=0 

which implies that 

J G ^)<<t>{\CB\8/2). (22) 

In order to complete the proof, we need to show that no ADC can achieve a better performance than 
(f> (\CB\S/2). It is sufficient to show that for all \& G there exists an input sequence r such that 

\<to Ml > \CB\6/2, Vn G Z+\{0}. (23) 



Define function p : M. m — > Z by 

"2k + 1 



= min < arg min 

1 fcez 



2 



5 - (CB^CAx 



(24) 



When r[n) is given by 

r[nl = 2p(x[n]) + 1 <5 - (OB^C^n], (25) 



2 

we have r[ra] G [—1, 1] (since 5 G (0, 2]) and 



\qv[n + 1]| 

for all n G Z + , because G A;5. Hence 



Cfl( 2 ^ 1)+1 tf-«[n] 



> |C£|5/2 (26) 



JgA*)> <t>{\CB\8/2). (27) 
Inequalities (|22l) and (|27l) complete the proof. 
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